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We establish an operational theory of coherence (or of superposition) in quantum systems, by 
focusing on the optimal rate of performance of certain tasks. Namely, we introduce the two basic 
concepts — “coherence distillation” and “coherence cost” in the processing quantum states under 
so-called incoherent operations [Baumgratz/Cramer/Plenio, Phys. Rev. Lett. 113:140401 (2014)]. 
We then show that in the asymptotic limit of many copies of a state, both are given by simple single¬ 
letter formulas: the distillable coherence is given by the relative entropy of coherence (in other words, 
we give the relative entropy of coherence its operational interpretation), and the coherence cost by 
the coherence of formation, which is an optimization over convex decompositions of the state. An 
immediate corollary is that there exists no bound coherent state in the sense that one would need 
to consume coherence to create the state but no coherence could be distilled from it. Further we 
demonstrate that the coherence theory is generically an irreversible theory by a simple criterion that 
completely characterizes all reversible states. 

PACS numbers: 03.65.Aa, 03.67.Mn 

Introduction .— The universality of the superposition 
principle is the fundamental non-classical characteristic 
of quantum mechanics: Given any configuration space X, 
its elements x label an orthogonal basis |a;) of a Hilbert 
space, and we have all superpositions XBx^M 2 ") as the 
possible states of the system. In particular, we could 
choose a completely different orthonormal basis as an 
equally valid computational basis, in which to express the 
superpositions. However, often a basis is distinguished, 
be it the eigenbasis of an observable or of the system’s 
Hamiltonian, so that conservation laws or even superse¬ 
lection rules apply. In such a case, the eigenstates |a;) are 
distinguished as “simple” and superpositions are “com¬ 
plex” . Indeed, in the presence of conservation laws, struc¬ 
tured superpositions of eigenstates can serve as so-called 
“reference frames” .which are resources to overcome the 
conservation laws [3-Q. Based on this idea, Aberg 0] and 
more recently Baumgratz et al. [BJ have proposed to con¬ 
sider any non-trivial superposition as a resource, and to 
create a theory in which computational basis states and 
their probabilistic mixtures are for free (or worthless), 
and operations preserving these “incoherent” states are 
free as well. This suggests that coherence theory can be 
regarded as a resource theory. 

Let us briefly recall the general structure of a quantum 
resource theory (QRT) and basic questions that be asked 
in a QRT through entanglement theory (ET), which is a 
well-known QRT. For our purposes, a QRT has three 
ingredients: (1) free states (separable state in ET), (2) 
resource states (entangled state in ET), (3) the restricted 
or free operations (LOCC in ET). A necessary require¬ 
ment for a consistent QRT is that no resource state 
can be created from any free state under any free op¬ 


eration. The QRT is then the study of interconversion 
between resource states under the restricted operations. 
The pure resource states play a special role and are much 
more preferable because usually they are used to circum¬ 
vent the restriction on operations (Bell states are used 
in teleportation to overcome LOCC operations, for in¬ 
stance). So the conversions between pure resource states 
and mixed ones are a major focus of QRTs. A standard 
unit resource measure can be constructed if the conver¬ 
sions between pure states are asymptotically reversible 
(entropy of entanglement of a pure entangled state in ET, 
with a Bell state as the unit). Then there are two basic 
transformation processes that are well-motivated: one is 
the so-called resource distillation, that is the transforma¬ 
tion from a mixed resource state to the unit resource, 
and the other is resource formation, that is the reverse 
transformation from the unit resource to a mixed state 
(entanglement distillation and entanglement formation in 
ET). Because of the reversibility in the pure state con¬ 
version we need not worry about what kind of pure state 
is the target, as they are equivalent up to the transfor¬ 
mation rate between them. Thus, two well-motivated 
quantities arise from the two basic processes, distillable 
resource and resource cost, which have a clear opera¬ 
tional interpretation (distillable entanglement and en¬ 
tanglement cost in ET). The principal objective of the 
theory is the characterization of these two quantities. 
This is often a highly complex problem, but resource 
monotones yield various limits on possible transforma¬ 
tions and achievable rates. Another basic question in 
any QRT is to ask whether the theory is irreversible or 
not. If the conversion between pure states is reversible, 
then the reversibility problem is reduced to the question 
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whether or not the optimal conversion rate in the forma¬ 
tion process is equal to that in the distillation process. 
If a QRT is reversible, a unique resource measure exits, 
quantifying the conversion rate between different states, 
so that everything about possible resource transforma¬ 
tions becomes clear and simple. However, if a QRT is 
irreversible, the phenomena are ample and further inter¬ 
esting questions can be asked, for example whether there 
exist so-called bound resource states as an analogue of 
bound entangled states in ET, in the sense that 

from them no resource can be distilled, but for which, in 
order to create them, nonzero resource is required. Sev¬ 
eral QRTs were constructed along these lines, some them 
indeed irreversible: entanglement theory (with respect to 
LOCC) [HSi thermodynamics (w.r.t. energy-conserving 
operations and thermal states) 10, 11], reference frames 
(w.r.t. group-covariant operations) [12j, etc. 

In this Letter, we establish an operational coherence 
theory in the framework proposed in [Hj and 0. Namely, 
first we show that the conversion between the pure coher¬ 
ent states is asymptotically reversible, so that the stan¬ 
dard unit coherence measure exists. Then we introduce 
the basic transformation processes - coherence distilla¬ 
tion and coherence formation -, from which two basic 
coherence measures naturally arise: distillable coherence 
and coherence cost, with operational interpretations. Re¬ 
markably, both are given by information theoretic single¬ 
letter formulas that hence make these quantities com¬ 
putable. These results in turn allow us to formulate a 
simple criterion to decide whether a given state is irre¬ 
versible or not and to show that there is no bound coher¬ 
ence. Although the main results are in the asymptotic 
setting, we also get the single copy conversion of pure 
states along the way. In the following, we state and dis¬ 
cuss our results carefully, while all proofs are found in 
Appendix B. 

Coherence as a resource theory. — We follow the frame¬ 
work of coherence theory by Baumgratz et al. Q. Let 
{|*)} be a fixed basis in the finite dimensional Hilbert 
state. The free states called incoherent states are those 
whose density matrices are diagonal in the basis, being of 
the form 'Yh i Pi\i){i\ where pi is a probability distribution 
and the set of incoherent states is denoted as A. The re¬ 
source states called coherent states are those not of this 
form. Quantum operations are specified by a set of Kraus 
operators {Kg} satisfying K.\Kg = 1; a quantum op¬ 
eration can have many different Kraus representations. 
The free operations, called incoherent operations (IC) are 
those for which there exists a Kraus representation {Kg} 
such that -— ^\ K K(f J Kf £ A for all i and all p £ A. 

Such restriction guarantees that even if one has access 
to individual measurement outcomes l of the instrument 
{Kg}, one cannot generate coherent states from an inco¬ 
herent state. Under this restriction, each Kraus operator 
is easily seen to be of the form Kg = c (*)|j(*))(*l where 


j (i) is a function from the index set of the computational 
basis, and c(i ) are coefficients; we call such Kraus opera¬ 
tors incoherent, too. If not only Kg but also K\ is inco¬ 
herent, we call it strictly incoherent , and the correspond¬ 
ing quantum operation a strictly incoherent operation. 
Strict incoherence can be characterized by the function 
j{i) above being one-to-one. Another equivalent form of 
a general incoherent Kraus operator is K = JA |j)(y 7 1, 
with | 7 j) £ span{|i) : i £ Sj} for a partition [d] = |J ; »Sy. 

We will be mainly concerned with state transforma- 

IC 

tions, for which we introduce some notation: p \—> a 
means that there exists an incoherent operation T such 
that a = T(p) = (T^LQp.Aj; h the operation is strictly 

incoherent, we write p \ — % a. If the transformation 
is obtained probabilistically, i.e. if a oc KgpK\ and 

Tr KgpKj ^ 0 for some £, we write p a and p i-—£ a, 
for probabilistic incoherent and probabilistic strictly in¬ 
coherent transformations, respectively. 

When we consider composite systems, we simply de¬ 
clare as incoherent the tensor product basis of the lo¬ 
cal computational bases; the incoherent operations are 
then defined with respect to the tensor product ba¬ 
sis. Notice that there are several special transforma¬ 
tions that are incoherent: phase and permutation uni- 
taries. In particular, in a composite system of two qu- 
dits, CNOT : |*)|j) \i)\i + j mod d) is an incoherent 

operation, as it is simply a permutation of the tensor 
product basis vectors [13j, [l4|] . In Appendix C we discuss 
a slightly more general and flexible model. 

We define the decohering operation A(p) = 
^A(*|p|i}|i)(i|, i.e. the diagonal part of the density ma¬ 
trix. This makes the incoherent states A the image of 
the map A, thus justifying the slight abuse of notation. 

Pure state transformations .— We start by developing 
the theory of pure states transformations; the main result 
in the present context is that in the asymptotic setting 
of many copies this becomes reversible, the rates gov¬ 
erned by the entropy of the decohered state, a quantity 
we dub entropy of coherence. First however, we review 
the situation for exact single-copy transformations. 

We start with a simple observation on ranks: Namely, 
let tp be transformed to if by an incoherent, or more gen¬ 
erally a probabilistic incoherent operation, | ip) oc K\ip) ^ 
0, for an incoherent Kraus operator K = XZi c i|j(*))(*l- 
The rank r of A(<p) is precisely the number of nonzero 
diagonal entries of A(<p), which is the number of nonzero 
terms in |<p) = l*)> 1^1 = r - Thus, 

\if) oc K\ip) = ^ip iCi \j{i)) 
i£R 

has at most r = |i?| terms. This proves the following. 
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Lemma 1 If ip ip, then rankA(^) < rank A (<£>), 
i.e. the rank of the diagonal part of pure states cannot 
increase under incoherent operations. 


Theorem 4 (Yuan/Zhou/Cao/Ma 51) For two 

pure states if and ip and a rate R > 0, the asymptotic 
incoherent transformation 


Theorem 2 (Cf. Du/Bai/Guo jisl ] ) For two pure 
states ip = |'0)(' i / , l an d T — |^)(^|, if A(ip) majorizes 
A(<p), A (ip) >- A (ip), then there is an incoherent (in 
fact, a strictly incoherent) operation transforming ip to 

i IC 1° / 
ip: ip i—> ip. 

Conversely, if ip H? ip, or if <p ^4 ip and in addition 
rankA(yj) = rank A (^>), then A (ip) >- A (ip). 


Here, the majorization relation for matrices, p >- a, 
means that the spectra spec(p) = p = (pi > ... > Pd ) 
and spec(g) = q = (qi > ... > qd) are in majorization 
order 16-181: 


t t d d 

Vi < d ^2,Pi > ^2 q%, and ^Pi = ^ <?,:• 

2=1 2=1 2=1 2=1 

As a consequence of Theorem [2j just as for pure state 
entanglement [l3|, there is catalysis for pure state inco¬ 
herent transformations, cf. 0 ; in [2D . examples such 
that initial and final states have equal rank are given. 
An immediate corollary of Theorem [2] is the following. 

Corollary 3 (Baumgratz et al. i) Let p be an arbi¬ 
trary state in C d , and 4> d = |$dX$d| = \ l*)0l • 

Then, there is an incoherent operation transforming 4V 
to p. 


ip® n V ip® nR as n 00 , e —>■ 0, 

is possible if R < an< ^ impossible if R > cfa) • 

In particular, ip can be asymptotically reversibly trans¬ 
formed into $ 2 , and vice versa, at optimal rate C(ip). 


Here, p ss <7 signifies that the two states have high 
fidelity: F(p,a) > 1 — e (see Appendix A for details). 

Now we are ready to introduce two fundamental tasks 
for arbitrary mixed states, namely asymptotic distilla¬ 
tion of p® n to 4>® ni? and the reverse process of formation 
p® n from $f nR . Note in this respect the fundamental 
importance of Theorem |4j which shows that we could 
equivalently elect any pure state ip (that is coherent) 
as unit reference for formation and distillation, and all 
rates would change by the same factor It turns 

out that both quantities have single-letter, additive ex¬ 
pressions: the former is given by the relative entropy 
of coherence , the latter by the coherence of formation', 
both are additive. This is in marked contrast to other 
resource theories, perhaps most prominently that of en¬ 
tanglement under LOCC, in which the basic operational 
tasks are only characterized by regularized formulas, and 
the fundamental quantities, such as entanglement of for¬ 
mation 241, relative entropy of entanglement 0 ], etc, 
are 


not additive 12! 0 . 


This motivates the name maximally coherent state 
for 4v. In addition to enabling the creation of arbi¬ 
trary d-dimensional coherent superpositions by incoher¬ 
ent means, 4v also allows the implementation of arbi¬ 
trary unitaries U € SU(d) jB:). Then, fixing the qubit 
maximally coherent pure state $2 = \ X^=o l*)0l as 
a unit reference, we are ready to consider asymptotic 
pure states transformations with vanishing error as the 
number of copies goes to infinity. Special cases of this 
are coherence concentration, the transformation from a 
non-maximally coherent pure state to the unit coherent 
state, and coherence dilution from unit coherent state to 
the non-maximally coherent pure one. As in informa¬ 
tion theory, entanglement theory and other similar cases 
(cf. [2lL l22]b this simplifies the picture dramatically. To 
express our result, we introduce the entropy of coherence 
for pure states as 


C(iP) = S(A(if)). 


Distillable coherence .— The distillation process is the 
process that extracts pure coherence from a mixed state 
by incoherent operations. The distillable coherence of a 
state is the maximal rate at which $2 can be obtained 
from the given state. The precise definition is the follow¬ 
ing. 


Definition 5 The distillable coherence of a state p is de¬ 
fined as 

C d (p) = sup I?, s.t. p® n ~ $® nR as n —> 00 , e —> 0. 


We look at the asymptotic setting to get rid of the error 
e, motivated by general information theoretic practice 
21, 22| and the success of this point of view in the pure 
state case (Theorem 0]) • By definition, Cd naturally has 
an operational meaning as the optimal rate performance 
at a natural task. Theorem ED shows that the distillable 
coherence is given by a closed form expression. 


Here S(p) = — Trplogp is the von Neumann entropy, 
where logarithms are to base 2. Note the maximum value 
of this functional among d-dimensional states is attained 
on 4v: C($d) = logd, in particular C(4>2) = 1 for the 
unit coherence resource. 


Theorem 6 For any state p, the distillable coherence is 
given by the relative entropy of coherence: 

Cd{p) = C r (p) = minS(p||<r). 
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The relative entropy of coherence is introduced and 
studied in detail in [413!■ Here, 5 '( / o||ct) = Trpflogp — 
log a) is the quantum relative entropy. In 3, Il|) it is 
shown that 

CM = s(A(p)) - S{p). (1) 

Furthermore, that the relative entropy of coherence is 
convex in the state, and a coherence monotone, mean¬ 
ing that for every incoherent transformation T(p) = 
^T,e KtpKf, C r (p) > C r (T(p)). In fact, it is even strongly 
monotonic Q: 

C r (p) > ^2peC r (pe), where pepe = K e pKj. 

But it is only due to Theorem [S] that we can give it 
a clear operational interpretation in the distillation pro¬ 
cess. Note that in concurrent independent work, Singh et 
al. (28j show that the same quantity, C r {p), by virtue of 
it being equal to the entropy difference S(A(p)) — S(p), 
arises also as the minimum amount of incoherent noise 
that has to be applied to the state to decohere it. 

Coherence cost- The formation process is that which 
prepares a mixed state by consuming pure coherent states 
under incoherent operations. The coherence cost is the 
minimal rate at which $2 has to be consumed for prepar¬ 
ing the given state. 

Definition 7 The coherence cost of a state p is defined 
as 

C c (p) = inf R , s.t. <frf nR ft p® n as n —»■ 00 , e —> 0. 

The next result shows that the coherence cost has a 
single-letter formula, involving a simple entropy opti¬ 
mization. 

Theorem 8 For any state p, the coherence cost is given 
by the coherence of formation, 

C c (p) = C f (p), 

where the coherence of formation is given by 

Cf(p) = min E Pi/S , (A('i/)j)) S.t. p — ^ ] P/ I'f'ifi'fi 1 • 

i i 

The coherence of formation is introduced as a mono¬ 
tone in 3, where its convexity and monotonicity under 
incoherent operations were observed (see Lemma [13] in 
Appendix A), however its additivity was not remarked. 
A priori, one might have expected the cost to be given by 
the regularization of the coherence of formation, which 
would have involved infinitely many optimization prob¬ 
lems so that the evaluation of the operational cost had 
become infeasible. Indeed it is additivity that makes the 
single-letter formulas in Theorem [6] and Theorem [8] avail¬ 
able, and because of its importance we record it as a 
theorem. 


Theorem 9 Both Cf and C r are additive under tensor 
products, 

C f (p®v)=C f (p) + C f (<r), 

Cr{p ®<j)= C r (p) + C r (<j). 

Theorem 0 means that also the distillable coherence 
and the coherence cost are additive, and so there are 
no super-additivity or activation phenomena in the re¬ 
source theory of coherence, unlike the theory of en¬ 
tanglement [m . 27], that of communication via chan¬ 
nels [27l . [29j|, and many other resource theories, where 
the yield (cost) of two resources together may be strictly 
larger (smaller) than the sum of the yields (costs) of the 
resources processed individually. 

Based on the formulas for the distillable coherence and 
the coherence cost, we are now ready to characterize pre¬ 
cisely the (ir-) reversible states. 

(Ir-)reversibility. — From the definitions of Cd and C c , 
it is immediate that Cd{p) < C c (p). A state is re¬ 
versible if the equality holds, otherwise it is called ir¬ 
reversible. From Theorem |U we see that all pure states 
are asymptotically reversible, the optimal transformation 
rate given by the ratio of their entropies of coherence. In 
the mixed state case, Theorem ITOl provides a simple cri¬ 
terion to decide whether the given state is reversible or 
not that completely characterizes all the reversible states. 
From this, we conclude that the mixed states are gener- 
ically irreversible and the coherence theory is an irre¬ 
versible resource theory. However in contrast to the irre¬ 
versibility in entanglement theory SHU] , there is no 
“bound coherence” (as an analogue of “bound entangle¬ 
ment” S- Zl)> from which no coherence could be distilled, 
but for which, in order to create it, nonzero coherence 
would be required. 

Theorem 10 A mixed state is reversible if and only if 
its eigenvectors are supported on the orthogonal subspaces 
spanned by a partition of the incoherent basis. That is, 

3 

and each \ cfj) £ TLj = span{|«) : i £ Sj}, such that 
Sj (~l Sk = 0 for all j k and TL = @jTLj. In other 
words, p is reversible if and only if there exists an inco¬ 
herent projective measurement, consisting in fact of the 
projectors Pj onto TLj, s.t. p = JT PjpPj, and PjpPj is 
proportional to a pure state. 

Note that the criterion is easy to check, and contrast 
this with the entanglement irreversibility of two-qubit 
maximally correlated states considered in 31], where 
Wootters’ formula [32| for calculating the entanglement 
of formation is used: Here we do not need the explicit 
formula of the coherence of formation, which involves an 
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optimization problem itself, and indeed we do not know 
a closed-form expression for it in high dimension. How¬ 
ever, here the equality constraint is so severe that we can 
learn the structure of the state. 

Theorem 11 There is no bound coherence: Cd(p ) = 0 
implies C c (p) =0. In other words, every state with any 
coherence (non-zero off-diagonal part) is distillable. 

Discussion .— We have shown that the incoherent op¬ 
erations proposed by Baumgratz et al. [fj give rise to 
a well-behaved operational resource theory of coherence. 
Remarkably, almost all basic questions in this resource 
theory have simple answers. We saw that it is a the¬ 
ory without bound coherence, but exhibiting generic ir¬ 
reversibility for transformations between pure and mixed 
states. This should be contrasted with the general ab¬ 
stract framework of Brandao and Gour [33], which ap¬ 
plies to the present theory of coherence, but rather than 
the incoherent operations considered here, requires all 
cptp maps £ such that the weaker condition £(A) C A 
holds. Both the distillable coherence and the coherence 
cost under this relaxed premise become C r (p), meaning 
that while we cannot distill more efficiently with this 
broader class of operations, formation becomes cheaper. 

One curious observation, for which we would like to 
have a strictly operational interpretation, is that the re¬ 
sulting theory of coherence resembles so closely the the¬ 
ory of maximally correlated entangled states. Indeed, 
under the correspondence 

P = ^2Pij\iXj\ <—» P = ^2Pij\HYj3l 
ij ij 


C{ip ) is identified with E(ip), the entropic measure of 
pure entanglement, C c (p) = Cf(p) is identified with 
the entanglement cost (which equals the entanglement 
of formation for these states), E c (p) = Ef(p), and 
Cd{p) = Cr{p) i s identified with the distillable entan¬ 
glement (which equals the relative entropy of entangle¬ 
ment for these states), Ed{p) = E r (p) [34[. Indeed, this 
answers all the basic asymptotic questions in the theory, 
which is much simpler than general entanglement theory. 
What is missing to elevate this correspondence from an 
observation to a theoretical explanation (cf. Streltsov et 
al. 0) is a matching correspondence between incoher¬ 
ent operations and LOCC operations. That would truly 
show that the two theories are equivalent, relieving us 
of the need to prove any of the optimal conversion rates 
reported in this Letter, which now we have to do by man¬ 
ually adapting the entanglement manipulation protocols. 
A notable gap in the above correspondence is the non- 
asymptotic theory of pure states: The diagonal entries of 
a pure state correspond to the Schmidt coefficients of the 
associated pure entangled state, and by Nielsen’s theo¬ 
rem [l8| a pure state can be transformed by LOCC into 


another one if and only if the Schmidt vectors are in ma- 
jorization relation. In the case of incoherent operations 
on pure states, we only know that majorization is suffi¬ 
cient for transformability but not whether it is necessary. 

To study this correspondence further, it might be 
worth investigating the optimal conversion rates R of in¬ 
coherent transformations 

p<S>n | ICj 


for general (mixed) states, and which for the moment we 
can only bound using the existing monotones: 

< 7? < min / £M\ 

C f (a)~ ~ {C r (ayC f (a)j- 


Given the close resemblance to entanglement theory, 
we may expect that one-shot coding theorems as well 
as finite block length analyses can be carried out, but 
we have refrained from entering this domain to maintain 
the simplicity of the asymptotic picture. As in one-shot 
information theory [35: ] . we expect that min- and max- 
entropies and relative entropies and Renyi (relative) en¬ 
tropies govern the optimal rates, which now would carry 
an explicit dependence on the protocol error. 

It remains to be seen whether similarly complete the¬ 
ories of asymptotic operational transformations can be 
carried out for other resource theories, such as that of 
reference frames 12]. Observe that as reference frame 
theories are built on group actions under which the free 
states are precisely the invariant ones, the present the¬ 
ory of coherence may be viewed as the special case of the 
group of the diagonal phase unitaries. 
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APPENDICES 

A. Miscellaneous facts and lemmas 

In this appendix, we collect standard facts about var¬ 
ious functionals we use and show some lemmas that we 
need in the proofs of the main results. 

We use the Bures distance B based on the fidelity F, 
but it is essentially equivalent to the trace norm: 

B{p,cr) = F(p,a), 

F(p, a) = Tr \Jp 1 l 2 op 1 l 2 . 

Namely [3b| . 

7 }B(p, a) 2 < i||p-cr||i < B{p,a). 

In the proof of Theorem [S] below, we need the asymp¬ 
totic continuity of C r . 


two states p and cr on a d-dimensional Hilbert space such 
that ^||p — cr|| i < p, then \S(p) — S' (cr) | < 77 log d + h(p), 
we have 

I C r (p) - C r (a )| = \S(A(p)) - S(p) - (S(A(a)) - S(a))\, 
<\S(A(p))-S(A(a))\ + \S(p)-S(a)\, 
< elogd + 2 /i(e/ 2 ), 

concluding the proof. □ 

In the proof of Theorem [5] below, we need the mono¬ 
tonicity of the coherence of formation, 

Cf(p) = min^pjS(A(^j)) s.t. p = ^pjipj 

i i 

under incoherent operations. 

Lemma 13 (Aberg [dj) The coherence of formation is 
convex and a strong coherence monotone: Indeed , for 

p^A a = '^2 K ?P K \ = ^Peve, 
t t 

it follows that 

C f (p) > ^2p t C f {ae) > Cf(a). 

t 

Proof. As a convex roof (aka convex hull), C/ is au¬ 
tomatically convex. Furthermore, because of the convex 
roof property, we need to prove the strong monotonic¬ 
ity only for pure states. Consider a pure state if and 
an incoherent operation £ whose Kraus operator set is 
{K(}, where each Kt is incoherent operator, satisfying 
= 1 . Suppose Kn\ff) = yjpl from this we 
construct a new incoherent operation £ whose Kraus op¬ 
erator set is {|£)® K(} where the |£) are the basis states of 
an ancillary system. From the monotonicity of the rela¬ 
tive entropy of coherence under the incoherent operation 
£, we obtain 

C r (lP) > C r (£(4 0) = r(V><) > Cf {£{$)), 

t 


Lemma 12 The relative entropy of coherence is asymp¬ 
totically continuous, i.e. for ||p — cr||i < e, 

| C r (p) - CV(cr)| < elog d + 2 d(e/ 2 ), 

where d is the dimension of the supporting Hilbert space , 
and h(x) = —xlogx — (1 — x)log(l — x) is the binary 
entropy function. 


and we are done. □ 

Our proof for the asymptotic continuity of coherence of 
formation comes from that of entanglement of formation. 

Lemma 14 (Nielsen |39l| . Winter |4ol] ) For two bi¬ 
partite states p and a supported on a d x d-dimensional 
Hilbert space with Bures distance B(p,a) < e < 1, then 


Proof. The proof is straightforward because of Eq. ©: 
From ||p — cr11 1 < e, we get ||A(p) — A(ct)||i < e by the 
monotonicity of the trace distance under the dephasing 
operation A, which is a cptp map. By Audenaert’s im¬ 
provement 37]] of Fannes’ inequality [381 ]. which states for 


\ E f(p)~ E f{o)\ < elogd+(l+e)/i (y 


□ 


In the proof of Theorem [ 8 l we need the asymptotic 
continuity of Cf. 







Lemma 15 The coherence of formation is asymptoti¬ 
cally continuous, i.e. for B(p,a) < e < 1, 

\ c f(p)- c f(<r)\ < elogd + (1 + e)/i > 

where d is the dimension of the supporting Hilbert space. 
Proof. Notice two facts: One is that 

||p — cr||i = ||cnot(/ 9® |o)(o|)CNOT t 

-CNOT(tr® |0)(0|)CNOT t || 1 , 

the bipartite states on the right hand side being sup¬ 
ported onadx d-dimensional Hilbert space, and the 
other is that C / (p) = E f (CNOT(p® |0)(0|) CNOT 1 ) Q. 
From these and Lemma [HI the claim follows. □ 

In the proof of Theorem [Gl we need the following slight 
modification of [U Prop. 2.4]. 


Lemma 16 (Cf. Devetak/Winter |4ll Prop. 2.4]) 

For a family {W x } xe x of quantum states on a d- 
dimensional Hilbert space TL, and the type class 
Tp = {W x n} w.r.t. an empirical type P of length-n- 
sequences over X, let (Yi,..., Yg) be a random tuple 
whose elements are sampled from Tp uniformly without 
replacement. Define the average state 


a(P) 



E W"- 

x n eTp 


Then, for every 0 < e, S < 1 and sufficiently large n, 


Pr 


S 


E*w( p ) 


3 =1 


> e 


< 2 d n exp -St" 


288 In 2 


where t = 2~ I ( X:W '> +S and I{X : W) = S{X ) + S{W) - 
S(XW) = S(JP P x W x ) — J2 P xS{W x ) is the mutual in¬ 
formation evaluated on the state u> = Px\T){x\ ® W x . 


Proof. Prop. 2.4 in |41| follows directly from the ma¬ 
trix tail bound in [42[ when the matrices are sampled 
uniformly with replacement, that is, Yj are i.i.d. random 
variables. As pointed out in [4^], the matrix tail bound 
in [42| still holds when the Yj are sampled uniformly with¬ 
out replacement 44|. So the modified [4l|, Prop. 2.4] also 
holds. □ 


In the proof of Theorem [TO], we need the following 
lemma, which comes essentially from the structure of the 
state with vanishing conditional mutual information [45[ . 


Lemma 17 (Winter/Yang ]46|) A state pab in the 

finite dimensional Hilbert space TLa ® Pb satisfying 


S(B) ~ S{AB) = E f (AB) 


is of the form 


P 


AB 




AB 

where <f> i ' are pure states and system B is decomposed 
as a direct sum of tensor products: TLb = © Pb^ ®Pb 11 ■ 


B. Proofs 

Here we provide the detailed proofs of the results in 
the main text. 

Proof of Theorem [2] Assume spec A^) = p >~ q = 
spec A (ip), and without loss of generality, 

IV>) = E V'Pi'l*). b)=Ev^i»). 

i i 

since the diagonal unitaries required to adjust the phases 
are incoherent operations. 

It is well-known ]16] that majorization implies that 
there is a probability distribution A „ over permutations 
7 t £ Sd such that 


<f= E 

7T 

where jT is the vector p with indices permuted according 
to 7r: fT(i) = Pir(i)- Observe that we may without loss 
of generality assume the pi and qi to be ordered non- 
increasingly, and also w.l.o.g. that all qi > 0, otherwise 
we reduce the Hilbert space dimension d. 

Now, define the Kraus operators 

Kir ■= E V / Ar 177-(z))(z|, 

V Qi 

which are evidently incoherent, and define a cptp map: 

Ea1^ = E a ^I*X*I = 1 - 

Qi 

tv in 

Furthermore, it effects the desired transformation: 

KM = E v / 'W^w1 7r (*)) = 


for every permutation n. 

Conversely, suppose 1 —> ip by the incoherent op¬ 
eration with Kraus operators {iQ}, then Kg\<p) oc | ip) 
for every i. Since rankA(-i/>) = rankA((/?), we arrive at 
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Kg = JA ce(i)\iT£ (z)){i|, with a permutation m on the ba¬ 
sis states. The same form obtains from the assumption of 
a strictly incoherent map. That is, cg(i)y/ql |7iy(i)) oc 

Hi y/Pi I*)- Now we show that 

I ip') AB L ^ C W) AB , 

where 

W) AB = CNOT(|^|0) S ) = Y v^l*> A |*) B , 

i 

W) AB = CNOT(|i/>) j 4 |0) b ) = ]T -Jpl\i) A \i) B . 

i 

Namely, Alice performs the measurement with Kraus op¬ 
erators {Kg} and tells the outcome £ to Bob, who then 
performs the permutation transformation ng on system 
B. Thus they end up with the state 

K a ® 7rf \p’) AB = Y c *K*)v / ^M*)) A M*))' B 

i 

^Y^Pi\i) A \i) B = W) AB - 

i 

I.e., | tp') AB can be converted to by an LOCC op¬ 
eration; from [lgj, we get that A(i/>) >- A (ip). □ 

Remark In [l5[ , it was claimed by a different approach 
that majorization A (i/>) >- A(</?) follows already from a 

IC 

general incoherent transformation ip i—>■ %/j. However, it 
seems that the proof has a gap, by way of some unjustified 
assumptions. For the moment, Theorem [2] is the best 
available statement. 

IC 

Proof of Corollay [3j If p is pure, then i —> p 

follows from Theorem [2] 

Any mixed state is a convex combination of pure 
states, ij)i with probability weight pi , each of which can 
be obtained from by an incoherent map Then, 
£ = HiPi^i i s also incoherent and p = £($d)- □ 

Proof of Theorem [4j Without loss of generality, we 
suppose a general pure state in d-dimensional Hilbert 
space in the form | if)} = JA =1 v^l*) w ^h a probability 
distribution (qg). The state of n copies of \iJj) is 

I ip)® n = Y •••*„|*i •■■*„}. 

i 1 • ••‘in 

We shall use information theory abbreviations for a 
generic sequence of length n, i n = i\ .. .i n , the proba¬ 
bility qtn = ■■■ q in and the state \i n ) = \i\ ... i n ) j f2lj . 

In analogy to entanglement concentration and dilution 
for bipartite pure state, we establish the conversion rate 
between the state | ip) and the unit coherence state | ( f> 2 ). 

Concentration: Consider the coefficients and the basis 
sequences of n copies of the state | i/j), we perform the 


type measurement {Mp, P £ V n }, where V n is the set of 
types of sequences with length n , and 

M P = Y l*i • ■ -*„X*i ■ • -*n|, 

i»£T(P) 

with T(P) the type class of P , i.e. the set of sequences 
i n in which the relative frequency of each letter i equals 
P(i). By the law of large numbers, with probability con¬ 
verging to 1, we get a type P xs Q (these are called typ¬ 
ical ), which hence has the property that H{P) « H(Q), 
where H ( P ) = — JA P(i) log P(i) is the Shannon entropy 
of the distribution P. Notice that this measurement is 
an incoherent operation, and that the output state af¬ 
ter application of Mp is the maximally coherent state on 
the subspace spanned by the type class T(P), which has 
dimensionality 

2 uH(p) > \T(P)\ >{n+ i)~ d 2 nH ( p \ 


so by relabelling the indices of the state - which can 
be realized by an incoherent unitary transformation -, 
the state can be transformed to with H(P) > 

R > H(P) - d log( " +1) , which tends to H{P) « H(Q) = 
when n —»■ oo. 

Dilution: Consider the coefficients and the basis se¬ 
quencers of n copies of the state | ip). We decompose the 
state into two terms 

I*/*) 0 ” = ^/Pi-(T Q " 5 )|typ) + ^l-Pr(T^)latyp), 
with the typical part of the state, 

|typ) = , 1 YV ( Hi'" ( Hn l*i ■■•*»), 

AWV 

summing over all i,i ■ ■ ■ i n £ Tq s , and an atypical rest 
|atyp). Here, Tq s the set of (entropy) typical sequences 
defined as 


- 7 -n ._ 

>Q,s — 



— il . . .in 


-log {qi- 

n 


■Qin )~ H (Q) 



By the law of large numbers, for any e > 0 and suffi¬ 
ciently large n, Pt(Tq s ) > 1 — £• Furthermore, \Tq S \ < 
2 n(H(Q)+6)' ( jq u1 -j 0I1 protocol uses then n(H(Q) + <5) 

copies of |$ 2 ), whose coefficients are uniform in dimen¬ 
sion 2 n ( H (Q' >+s ^ and by Theorem [2] this state can be de¬ 
terministically transformed by an incoherent operation to 
any other pure state in dimension 2 n ^ H ^' ,Jr& \ So we can 
prepare the state |typ) satisfying F(|typ)(typ|, ip® n ) > 
\/l — e, and the required rate of qubit maximally coher¬ 
ent states |$ 2 ) tends to H(Q) = S(A(i/))). 

Optimality: We show both at the same time. Namely, 
consider a pure state \ip) and asymptotic transformations 

and Q®nR'+o(n) ^_ We know 

already that R = = R' are achievable. Assume 
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by contradiction that one of the two can be improved, 
i.e. R > achievable for concentration or R' < 

is achievable for dilution. Thus, we can have R = > 

1. Then, by composing the dilution and concentration 
protocols, we get an incoherent transformation 

,p» n , IC ; ^ ^nR-o(n) 


The other, “bad”, subsets are few. Relabelling the in¬ 
dices i n O (m, s), we can write Eq. © as follows. 

M S 

I (t>{Q)) AE = -rjj E ® ~r§ E l s )l^)- 

v m =1 v s=l 

Now introduce 


for large n. But by Lemma [TJ each Kraus operator Kg 
of this transformation must produce a state \ipe) with 
rank Afy/y) < 2 n . From this one can readily derive 

F 2 (i/j e ,^ R ~ o{n) ) = Tr^i , 2 fl_o( " ) < 2"” (fl_fl) “ o(n) , 

and by convex combination, the output of any incoher¬ 
ent operation on &f n has exponentially small fidelity 

< with This contradiction 

shows that necessarily R < C(i/j) < R'. □ 

Proof of Theorem© The proof consists of two parts, 
the direct part showing that the claimed rate is achiev¬ 
able, and the converse part showing that no more can 
be distilled. In the direct part, we use the typicality 
technique and a covering lemma on operators. For the 
converse part, we follow the standard argument where we 
need the monotonicity, asymptotic continuity, and addi¬ 
tivity of relative entropy of coherence. 

Consider the purification 

\<t>) AE = ^ 2 y/Pi\i) A \^i) E 

i 

of p, i.e. Tie = p A - We perform the type mea¬ 

surement {Mp, P € V n }, where V n is the set of types of 
sequences with length n, and 

M P = ^2 \ii...in){ii---in\, 

i n €LT(P) 


where T(P) the type class of P (see the proof of Theo¬ 
rem 13]). 

By the law of large numbers, with probability converg¬ 
ing to 1, we get as outcome a typical type Q ~ P, hence 
with the property H{Q) « H(P). Then the bipartite 
state we are left with is 

m))AE = pmm £ ( 2 ) 

v\ 1 W’l i n eT(Q) 


From Lemma Htn we conclude that there exists a partition 
of the type class T(Q) into \T(Q)\/S =: M subsets {S'™} 
with | Sjn j = S such that at least M(l —e) of these subsets 
of {S'™} are “good” in the sense that the average states 
of system E over the subsets is almost the same as a fixed 
state, the average state over T(Q), 


1 

S 


E 


i»es m 


1 

Win 


E *-■ 

i n £T(P) 


\<t>(Q)m) : = -EE l s )l < ?W; 

v s—1 

by Uhlmann’s theorem [13,[HI, for “good” m, there exists 
a unitary U m on A such that (U m 0 t)\<j>(Q) m ) ~ \4>(Q)o)- 
Now we perform the measurement {P g0 od, Pbad}, and 
with probability 1 — e, we get the state 

E l TO >l <!>{Q)m)- 

good m 

Then we construct the incoherent operation whose Kraus 
operators are of the form 

K s = '22 \m)(m\ 0 |0}(s|[/ m . 

good m 


ViM 


This gives 

k s \HQ)) ae 


,, rn - E l m )l°)( s l < K < 9)o}> 
V M 0 --*) good m 


which shows that we obtain an approximation to the co¬ 
herent state 2m Tnn I m). 

It remains to estimate the quantity M in the above pro¬ 
tocol. Observe 2 nH ^ > \T(Q)\ > (n + l)~ d 2 nH ^ and 
S = 2 nI ( A[E ' l “, where the mutual information is calcu¬ 
lated with respect to the state w = XX ft I I A ® 1 4>i X A I E ■ 
Since Q is a typical type, we have H(Q) rts H{P) = 
S{A(p)), and 


I (A : E) u = S(co E ) « 5 J2p^ = 5 (A 


Thus, we get i log M —> S(A(p)) — S(p) when n —» oo. 
Conversely, for any protocol C n such that 
< e, we have 


nC r (p) = C r (p® n ) (3) 

> C r {C n {p® n )) (4) 

> C r {$f nR )-ne\ogd-2h(e/2) (5) 

= nR — nelogd— 2h(e/2), (6) 


where Eqs. m and © come from additivity and Ineq. © 
is due to monotonicity and Ineq. © due to asymptotic 
continuity. So R < C r (p)+elogd+2h(e/2). When e —> 0 
and R —> Cd{p) as n —> oo, we obtain Cd(p) < C r (p). □ 
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Proof of Theorem [8j The proof consists of two parts, 
the direct part that shows we can prepare the state at 
the claimed rate, and the converse part that says that to 
prepare the state we have to consume $2 at least at that 
rate. The proof is very similar to the entanglement cost 
[49I ] . except that here we have additivity which makes it 
simpler. In the direct part, we just prepare the typical 
part of the state and for the converse part, we still have 
the standard argument where we need the monotonic¬ 
ity, asymptotic continuity, and additivity of coherence of 
formation. 

For an optimal convex decomposition, p = Y^iPifai 
where the indices i range over an alphabet If (w.l.o.g. of 
cardinality |S7| < d 2 , by Caratheodory’s Theorem), we 
have 


P®" = 

i n 

with i n = *i.. ,i n , Pin = p i± ■ ■ ■ p in and fan = fa 1 ® ■ ■ ■ ® 

fa n - 

The set of (frequency-)typical sequences, 


T={i n :Vj |/ J -(* n )-p i |<5 1 }, 

with fj(i n ) = -^\{t : it = j}\, has the property that for 
large enough n, 


Pr(T) > 1 - ci. 

Now we rewrite p®" = p(T) + po, with the sub¬ 
normalized state 

p(T) = ^ fafa(fa"l 

i“£T 

and a rest p<j. The protocol is now to sample an el¬ 
ement i" e T according to the conditional distribu¬ 
tion p® n \r °f P®" restricted to T. The number 

of occurrences of j in each typical sequence i n G T is 
N(j\i n ) 5s n{pj + 5i). By the conherence dilution proto¬ 
col we can prepare a state 

Pj ~ Tj 

by consuming at most n(pj + 6i)(S(A(tpj)) + 6 2 ) copies 
of $ 2 . So we can prepare 




SJ| 


=: Pi 


fa-' 


using at most J+ n{pj + 6 i)(S(A(fa)) + <5 2 ) copies of <J> 2 . 
Now we prepare a mixed state as the convex combination 
of these p^n, i.e. 


P 


(«) 


i 

p®"(T) 


Pi n Pi n * 

i“er 


By the joint concavity of the fidelity [47), |48|, we get 


F(p® n ,p (n) ) > p® n (T)F 


p(T) 




p® n (T) ’ 
> (l- ei )(l-e 2 ) |a| . 


Since |f2| is bounded, when n —> 00 , 61,62 —> 0 and 
F(p® n , p^) — > 1 for 61,62 —> 0. The required rate of 
$2 tends to ^2piS(A(fa)) = Cf(p). This shows that the 
rate C/(p) is asymptotically achievable. 

For the optimality, consider an incoherent protocol 
that produces p ^ = £($f m ), which is close to p® n up 
to error e, i.e. F(p( ra ), p® n ) < e. From the monotonicity, 
asymptotic continuity and additivity of C/(p), we get 


>C f (pM) 

> C f (p® n ) - ne log d — (1 + e)h 
= nCf (p) — ne log d— (1 + e)h 


1+6 
6 


1 + 6 


(7) 

( 8 ) 
(9) 

( 10 ) 


where Eqs. 0 and mi come from the additivity, 
Ineq. © from the monotonicity and Ineq. m from the 
asymptotic continuity. Thus, 


^ > C f (p) - elogd- (1 + e)h £ 


Letting e + 0 as n + 00 , we get C c (p) > Cf(p), as 
advertised. □ 

Proof of Theorem [9j We observe that the coherence 
of formation C/(p) is equal to the entanglement of for¬ 
mation of the bipartite state CNOT(p ® |0)(0|) CNOT^. 
The latter is a maximally correlated state in the sense 
that in any decomposition into pure states ensemble, the 
pure states have the same Schmidt basis. In [31], the en¬ 
tanglement of formation for this class of states is proved 
to be additive. So the coherence of formation is additive. 

Additivity of C r comes directly from Eq. m because 
S(p ® cr) = S(p) + S(er) and S(A(p <8> cr)) = S(A(p)) + 

S(A(<t)). □ 

Proof of Theorem 1101 Given a mixed state p A , 
we construct the bipartite state a AB = CNOT(p A ® 
|0)(0| s )CNOT t which is a maximally correlated state 
of the form Pij\ii)(jj\■ Now the reversibility implies 
that S(a B ) — S{a AB ) = Ef(a AB ). Applying Lemma [171 
to this and noticing that the a A = a B we get 


a AB = \faXfa 


I AB 


where 4> B + (f) B when j ^ k. Then using the CNOT again 


on cr AB we recover the original state p A of the form 


P A = ®Pj \fa)(fa 
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where each \4>j) A is in the subspace Hj = span{|i) : i £ 
Sj} and these subspaces are orthogonal to each other. □ 

Proof of Theorem 111! This follows from Theorem 
1 Cd{p) = C r (p) and the fact that C r {p) is a faithful 
coherence measure in the sense that C r (p) = 0 if and 
only if p is incoherent. □ 

C. Generalized model 

While we restricted our treatment for simplicity of no¬ 
tation to the case that the incoherent states are precisely 
a fixed orthogonal basis of the Hilbert space and con¬ 
vex combinations, we observe that our results carry over 
unchanged to the case of a general decomposition into 
subspaces H = ©, Hi. 

In generalization of the picture in the main body of 
the present work, with an orthogonal basis and its su¬ 
perpositions, and inspired by Aberg [dj, let us consider 
an orthogonal decomposition of the Hilbert space H into 
eigenspaces of some observable 

O = ^2 EiPi , so that H = (J) H z , 

i i 

with an orthogonal direct sum, and Pi is the projector 
onto the subspace Hi oiH. With respect to this decom¬ 
position, we declare states as incoherent that respect the 
direct sum decomposition: 

A := i^p = 'Y^q z p i : P z piP z = pi^j , (11) 

where the p,; are states (supported on Hi) and (g*) is a 
probability distribution. For the case of rank-one pro¬ 
jectors Pi = |i)(i| we recover the notion of Baumgratz et 
al. @. 

Slightly abusing notation, we also introduce the deco¬ 
hering projection map 

A{A)=Y J Pi Ap i , (12) 

i 

such that A = A (S(H)) is the image of the state space 
under the decohering map; the image of the set of all 
operators on H we denote by 

®:=A (B(H))=@B(Hi). 


We call an operator K acting on H = © ; Hi, or more 
generally mapping H to K, = ©^ /©, incoherent (IC) if 
KHi C /Cj(j) for every i and a function i j = j(i). 
If j (i) is injective - which for operators mapping H to 
itself means that it is a permutation , we call the op¬ 
erator strictly incoherent (strictly IC). The operator K 
is strictly incoherent if and only if K as well as are 
incoherent. When composing systems, each with its own 
orthogonal decomposition, H = © { - Hi and H’ = ©■ H (, 
we consider the tensor product space with the decompo¬ 
sition H®H f = ©j ■ Ht®H'y In this way, the decohering 
map on the composite space becomes the tensor product 
A (g> A'. 

With this, following Q, we can introduce the sets of 
incoherent and strictly incoherent operations as special 
cptp maps: 


1C := jr cptp: T(p) = ^ K a pK t, Va K a IC j , 

ICq := < T cptp: T(p) = ^ K a pK 1, Va K a strictly IC 

l a 

By slight abuse of notation, we will write for a incoherent 
(strictly incoherent) Kraus operator K, that K £ 1C 

(KelCo). 

Finally, the non-coherence-generating maps according 
to Brandao and Gour [33] are 

1C ■.= {T cptp: T(A) C A} , 

so that ICq C 1C C 1C. 

Our main results, Theorems mm m im and the addi¬ 
tivity Theorem [9j remain unchanged, as one can check. 

Remark All of the transformations or asymptotic trans¬ 
formations contained in the above mentioned theorems 
can be effected by strictly incoherent operations (1C o), 
except the distillation of mixed states. Indeed, it is not 
clear whether or not the rate C r (p) is attainable with 
strictly incoherent operations. Our protocol for Theo¬ 
rem 0 at any rate, uses 1C in a non-trivial way. 


